Comparable English - Russian Book Review Corpora for Sentiment Analysis
نویسنده
چکیده
This paper presents a newly designed comparable corpora of book reviews consisting of two parts: Russian and English representing two very different languages. The corpora are comparable in terms of domain, style and size. This set of corpora may be of use for cross-lingual experiments in document-level sentiment classification. We also present brief description of the languageand domain-specific issues observed in these corpora.
منابع مشابه
Collocational patterning in cross-linguistic perspective: adpositions in English, Nepali, and Russian
This paper presents a contrastive analysis of adpositions in English, Nepali and Russian corpora. Two sets of highly frequent adpositions, those with broadly locative and broadly ablative meaning, are contrasted. The ‘quantitative-distributional’ analysis is based on identifying patterns across the most statistically significant collocations of the words in question; it is undertaken using 1 mi...
متن کاملRussian-English, English-Russian Dictionary on Probability, Statistics, and Combinatorics (K. A. Borovkov)
Read more and get great! That's what the book enPDFd russian english english russian dictionary on probability statistics and combinatorics will give for every reader to read this book. This is an on-line book provided in this website. Even this book becomes a choice of someone to read, many in the world also loves it so much. As what we talk, when you read more every page of this russian engli...
متن کاملNeural Networks for Sentiment Analysis in Czech
This paper presents the first attempt at using neural networks for sentiment analysis in Czech. The neural networks have shown very good results on sentiment analysis in English, thus we adapt them to the Czech environment. We first perform experiments on two English corpora to allow comparability with the existing state-ofthe-art methods for sentiment analysis in English. Then we explore the e...
متن کاملSyntactic Complexity of Russian Unified State Exam Texts in English: A Study on Reliability and Validity
In this study we analyze texts used in Russian Unified State Exam on English language. Texts that formed small research corpora were retrieved from 2 resources: official USE database as a reference point, and popular website used by pupils for USE training “Neznaika” (https://neznaika.pro/). The size of two corpora is balanced: USE has 11934 tokens and “Neznaika” - 11918 tokens. We share Biber’...
متن کاملSentiment Analysis in Scholarly Book Reviews
So far different studies have tackled the sentiment analysis in several domains such as restaurant and movie reviews. But, this problem has not been studied in scholarly book reviews which is different in terms of review style and size. In this paper, we propose to combine different features in order to be presented to a supervised classifiers which extract the opinion target expressions and de...
متن کامل